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1 Introduction 

In ^3] Shor proved equivalence of several open (sub) additivity problems 
related to the Holevo capacity and the entanglement of formation (EoF). In 
jHj we showed equivalence of these to the additivity of the Holevo capacity 
for channels with arbitrary linear constraints. 

This note is the development of [H] in the direction of channels with gen- 
eral constraints. Introducing input constraints provides greater flexibility in 
the treatment of the additivity conjecture. On the other hand, while [To] . 
jH] deal with the "global" additivity conjecture, i.e. properties valid for all 
possible channels, in this paper we make emphasis on results valid for indi- 
vidual channels. The Holevo capacity of the arbitrarily constrained channel 
is considered and the characteristic property of an optimal ensemble for such 
channel is derived (proposition 1), generalizing the maximal distance prop- 
erty of Schumacher and Westmoreland ^3] ■ This property provides a useful 
estimate for the Holevo capacity of the constrained channel (proposition 2). 
Our main point of interest is the additivity conjecture for two arbitrarily 
constrained channels and its relations to Shor's channel extension [15 . An 
attempt to prove converse of the theorem 1 in jHj, in which sufficient condition 
for " constrained additivity" was formulated, leads to the notion of asymptotic 
additivity for sequences of channels. This notion applied to the sequences 
of Shor's channel extensions makes it possible to formulate the main result 
of |H] in the form of necessary and sufficient condition. This implies that 
the additivity conjecture for two channels with single linear constraints is 
equivalent to the similar conjecture for two arbitrarily constrained channels 
and, hence, to an interesting subadditivity property of the x-function for the 
tensor product of these channels. 
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This subadditivity property of the x-function seems to be very appealing. 
It implies the unconstrained additivity and can be used for proving the latter 
in some cases. This property can be established for several types of channels 
(proposition 4). The characteristic property of the optimal ensembles for 
constrained channels provides necessary and sufficient condition for additivity 
of the Holevo capacity for two channels, constrained by fixing partial states 
of average state of input ensemble (theorem 2). This leads to an interesting 
characterization of the channels for which subadditivity of the ^-function 
holds (corollary 5). 

Global subadditivity of the x-function follows obviously from the strong 
superadditivity of the entanglement of formation and from the MSW channel 
representation jH]. But it turns out that the converse is also true (corollary 
6). With the corollary 3 this gives another (in comparison to |15j ) way 
of proving that the global additivity conjecture for unconstrained channels 
implies strong superadditivity of the entanglement of formation. 

The arguments from the convex analysis raised in |2] provide another 
characterization of channels for which subadditivity of the ^-function holds 
(theorem 3). This characterization and some modification of Shor's chan- 
nel extension provides a simple way of proving that global additivity of the 
minimum output entropy for unconstrained channels implies global subad- 
ditivity of the x-function and strong superadditivity of the entanglement of 
formation. 

2 Channels with arbitrary constraints 

Let $ : i— > &{7i') be an arbitrary channel (here are finite 

dimensional Hilbert spaces). Let {p{\ be an arbitrary ensemble of states 
in (5(7i) with the probability distribution {vrj}, which will be denoted as 
{Ki,pi}. 

The Holevo quantity [3] for this ensemble is defined by 




For an arbitrary state p we denote 



X*(p) = max x<s>({TTi, pi}) 
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where the maximum is over all ensembles {iii, p^ with Y^i^iPi = P- The 
function x${p) (briefly x-function) is continuous and concave on &(TC). The 
concavity of x<s>{p) easily follows from the definition. The continuity of x#G°) 
can be derived from the MSW-correspondence [S] and the continuity of the 
entanglement of formation [5]. 

Fixing a closed subset A of ©(H) we can consider the constraints on the 
input ensemble {7^, p^ of the channel $ defined by the requirement p av G A, 
where p av = J2i ^iPi i s the average state of the ensemble. This type of 
constraints is a natural generalization of linear constraints considered in jl] , 
where the subset A is defined by the inequality TrAp av < a for some positive 
operator A and positive number a. 

Define the Holevo capacity of the ^.-constrained channel $ by 

C($;A) = maxxM = max X*({n,Pi})- (1) 

PG.4 £7riPiG.A 

Note that the unconstrained capacity C(&) = C($; &{7i)). 

Any ensemble {iii, Pi} on which the maximum in ([I]) is achieved is called 
an optimal ensemble for the A -constrained channel $. In ^3] it was shown 
that an optimal ensemble {tt^, p^} (with the average state p av ) for the uncon- 
strained channel $ is characterized the by the maximal distance property: 

S($M||$( Pav )) < X 0({n,Pi}), Vcu g &(H). 

The generalization of the above property for the constrained channels is given 
by the following proposition. 

Proposition 1. The ensemble {7Tj,pj} with the average state p av G A is 
optimal for the ^.-constrained channel $ with a convex set ^4. if and only if 

^^S($(^)||$(p av )) < XsiUhPi}) 
j 

for any ensemble {p,j,ujj} of states in 6(7i) with the average state u; av G A 
Proof. The proof is a simple generalization of the arguments in ^3] • The 
specific feature of the "constrained" situation consists in the necessity to 
consider variations of the initial ensemble by mixing not only one state as in 
[T3| . but a genuine ensemble. 

Let {vrj,pj}™ =1 and {pj, ujj}™ =1 be two ensembles with average states p av 
and c<j av , contained in A. Consider the modification of the first ensemble by 
adding the second one with the weight coefficient rj. The modified ensemble 

{(1 - f])n 1 p 1 , (1 - r])ir n p n , r/pi^i, rjp m u m } 
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has the average state p\ = (1 — r])p av + r]u av , contained in A (by convexity). 
Let x<s> an d xl be the Holevo quantities for the original and the modified 
ensembles correspondingly Using the relative entropy expression for the 
Holevo quantity [HJ we have 



Xl = (1 - rj) E \MpD) + V £ H $ (Pav))- (2) 



i=l 



Applying Donald's identity to the original ensemble we obtain 

n 

i=i 

Substitution of the above expression into (j2j) gives 



xl = X* + V 



.j=i 



+ (l-Tj)S(Q(p w )\\*(fil r )). (3) 



Applying Donald's identity to the modified ensemble we obtain 

n m 

(i - n) Ensm Pi )\mp av )) + v E^(^(^-)P(Pav)) 

= xi + s(Hpi)\m Pav )) 



and hence 



.i=i 



WpZJHsGU). (4) 



Since the relative entropy is nonnegative the expressions (JHJ) and Q imply 
the following double inequality for the value = X$ — X* : 



< a x $ < ; 

m 

E^(^)ll*(Pav))-X* 



(5) 
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Now the proof of the proposition is straightforward. If 

£^£(*(wi)||*0w))<X« 

3 

for any ensemble {fij,Uj} of states in &(TC) with the average state cj av G A, 
then by (0) with rj = 1 we have 

X*({^i:^i}) = X$ < X<£ = X$({n,Pi}), 

which means optimality of the ensemble {7Tj,pj}. 

To prove the converse, suppose {iii, Pi} is an optimal ensemble and there 
exists an ensemble {fij,Uj} such that 

3 

By continuity of the relative entropy there is rj > such that 

j2»jSmcu j )\m P i))>x*, 

because p![ v tends to p av when 77 tends to zero. By the first inequality in JH}, 
the last inequality means that x\ > X* m contradiction with the optimality 
of the ensemble {7^, p.;}. A 

The above characteristic of an optimal ensemble provides the following 
estimate for the Holevo capacity of the constrained channel. 

Proposition 2. Let p av be the average state of any optimal ensemble for 
the ^.-constrained channel $ with a closed convex subset A of &(TC). Then 

C(<&; A) = x*(Pav) > xM + S($(p)\\$(p w )), Vp e A. 

Proof. Let {7Tj,pj} be an arbitrary ensemble such that Yli^iPi = P £ A. By 
proposition 1 

^^(^(pOII^Pav)) < X$(Pav). 

This inequality and Donald's identity 

^7r ? ;5($(p i )||$(p av )) = x*({7r l ,p i }) + 5($(p)||$(p av )). 

i 

complete the proof. A 
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3 Shor's channel extension and constrained 



channels 

Let ^ : <5(/C) i— > <5(/C') be another channel with the constraint, defined 
by a closed subset B of &(JC). For the channel $ © \l/ it is natural to con- 
sider the constraint defined by the requirements a® v = Tr^a av G A and 
cr* = Tpft<7 av G £> on the average state cr av of the input ensemble {7^,0^}. 
The closed subset of &(H © /C) defined by the above requirements will be 
denoted as ^4. © £>. 

We conjecture the following additivity property for constrained channels 

C($© ^;A®B) = C($;A) + C(V;J3). (6) 

Note that additivity of the Holevo capacity is a partcular case of (JHJ) with 
A= &(H) and B = 6(/C). 

In jH] the role of Shor's channel extension ^5] m connection with the 
constrained additivity problem was demonstrated. For reader's convenience 
we give here a brief description of this notion. 

Let E be an operator in 03(H), < E < I H (the identity operator in 
the space TC), let q G [0; 1] and d G N = {1,2,...}. Consider the channel 
$(E,q,d), which maps 53(H) <g> C d into 03(H') © C d+1 , where C d is the 
commutative algebra of complex <i-dimensional vectors describing a classical 
system. By using the isomorphism of 03(H) ® C d with the direct sum of 
d copies of 03 (H), any state in 03 (H) ® C ' can be represented as an array 
{pj}^ =1 of positive operators in 03(H) such that TrJ^ =1 pj = 1. The action 

of the channel $(E, q, d) on the state p = {pj} d =l with p = Y^=i Pj * s defined 
by 

q, d){p) = (1 - g)$ (p) © g$i(p), 

where $ (p) = $(p) G 6(H') and ^(p) = [Trp£, Trp^, ...,Trp d £] G C d+1 
(throughout this paper we use the notation A = I — A for operators). Note 
that $0 and $1 are channels from 23(H) © C ' to 03 (H') and to C d+1 
correspondingly. The state space of the channel $(E, q, d) will be denoted 
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We will need some generalization of the proposition 1 in j^j. 
Proposition 3. Let \I/ : <5(/C) i— > S(/C') be an arbitrary B -constrained 
channel. Consider the channel <$>(E, q, d) ® \P. Then 

g, d) Cg> 6$ <g> fi) - max [(l-g)x*®»W + glogrfIt(£ ® J*) a] 

< g(logdim/C' + 1). 

Proof. Due to the representation 

$(E, q,d)®^ = ((1 - g)$ <8> *) © (g) *) , (7) 

the lemma 1 in [H] reduces the calculation of the Holevo quantity 9 
for any ensemble of input states to the calculation of the Holevo quantities 
X$ fe ®*> k = 0,1 for this ensemble: 

X$>(E,q,d)®if = (1 ~~ ?)X*0®* + <?X*i®*- ( 8 ) 

Note that any state <r in 05 (W) (g> C d <g> 05 (/C) can be represented as an 
array {crj}| =1 of positive operators in 05(7i (g) /C) such that Tr Y^=i cr j = 1- 
Denote by 5j(cx) the array a with the state a in the j-th position and with 
zeroes in other places. 

It is known [13J that for any channel there exists a pure state optimal en- 
semble. This fact and symmetry arguments imply the existence of an optimal 
ensemble for the channel Q(E, q,d) <g) \I/ consisting of the states a^j = Sj(o~i) 
with the probabilities fiij = d' 1 ^, where {/ij, o~i\ is a particular ensemble of 
states in &(TC<S>JC) (cf. [IS]). Let cr av = Yli jV-ij^hj anc ^ a ^ = Y^if 1 ^ be 
the average states of these ensembles. Note that a av = [d _1 cr av , <i -1 cr av ]. 

The action of the channel $o <g> ^ on the state a = {cj}^ =1 with 
a = X]f=i °"i i s defined by 

$ <g> *(cr) = $ (g) *(cr). 
Hence $ <g> ^(S^j) = * <8> *(c>i) and 

Let us prove that 

= logrfTr(E ® 7/c)cr av + /#({//», a*}), (10) 
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where f^dfiuCi}) < log dim /C' +1. It is easy to see that the action of the 
channel $1 ® \I/ on the state a = {<Ji}f =1 with a = Ei=i °% i s defined by 

$! <g> = [*e((7), * B («7i), ^bM], 

where ^a(") = Tr^ (A <g> • (Id ® ^)(-)) is a completely positive map from 
&(H ® /C) into «8+( £') (A — E,E and Id is an identity map on 6(H)). 
Therefore, 

® V(a id )) = H(^ B (ai)) + H(V E (<Ti)), (11) 

and 

= [%((7 av ), tT 1 * jg(<7 av ), d-^B^av)], 

Due to this we can conclude that 

H($i (g> ^((T av )) = logd W £ K v ) + iJ(^((7 av ))+i7(^K v )). (12) 
It is easy to see that 

M £ (a) = Tr(£ ® 7^) a (13) 
Using dllj), CD and JTJJ), we obtain 

X*i®*({ft,i^ij}) = logdTr(E ® i*;)cr av + 
#(* £ (<x av )) + #(^(<7 av )) -E^(^(*s(<Ti)) + #(^(a 4 ))) (14) 

Using the inequalities < H(S) < TrS'(logdim?Y — logTrS 1 ) for any positive 
operator S G B{7i) and h,2(x) = x logs + (1 — x) log(l — x) < 1 it is possible 
to show that 

f£(.{tk,<*i}) = X^ E i.{^ a i}) + Xv s ({Hi,<7i}) < logdim/C' + l. (15) 

Applying this estimation to (|14|) we obtain ()10p. 
The expression (JHJ) with © and (jl(Jj) imply 

X$( J B,g,d)(xD*({^j>^,i}) = ( 1 - 9)x*o®*({^,i>^ij}) + 9X*i®*({^i,i>^i,i}) 

= (1 - g)x*®*({/ii, «7i}) + <? log dTro- av (E ® I K ) + qfi({fH, <?%})■ 
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The last equality with (|15|) completes the proof. A 

When dealing with Shor's channel extension in connection with the con- 
strained additivity problem it is convenient to slightly change notation. For 
nonnegative number p and operator < A < I we will denote by $d(Ap) 
the Shor's extension $>(A,p/ log d, d) of the channel $. 

Definition 1. We say that the additivity conjecture holds asymptotically 
for the sequence of unconstrained channels {$ n : &(H n ) i— > &(H.' n )} and the 
channel \& : ©(/C) i— > &{JC') with the constraint defined by the set B if 

lim C{<b n ®^,&{H n )®B) = lim C($ n ) + C(*,B), 

n — >+oo n— >+oo 

assuming that the limits exist and are finite. 

Note that if the additivity conjecture holds for the channels $ n and $ 
for all sufficiently large n it obviously holds asymptotically for the sequence 
{$„} and the channel 

Theorem 1. Let $ : &(H) i-> &{W) and # : &{K) ^ &{K') be 
arbitrary channels with the fixed constraint on the second one defined by the 
set B. The following statements are equivalent: 

(i) The additivity conjecture is true for the channel $ with the constraint 
TrAp < a for arbitrary (A, a) and the £>-constrained channel 

(ii) The additivity conjecture holds asymptotically for the sequence of the 
channels {^d(A,p)}deN (without constraints) and the fi-constrained 
channel \I/ for arbitrary nonnegative number p and arbitrary operator 
< A < I; 

(iii) The additivity conjecture is true for the „4-constrained channel $ with 
arbitrary closed subset A in <5(7i) and the £>-constrained channel \1/ . 

Proof. The implication (iii) =^ (i) is obvious. So it is sufficient to prove 
(i) (ii) and (ii) (iii). 

Note, first of all, that for an operator < A < I and a number p > 
proposition 3 implies 

lim C($ d (A,p)) = max[x$(p) +pTrAp] (16) 

ci^+oo p 

and 

lim C($ d (A,p) ® *) = max [x$®*(a) + pTr(A ® I K ) a] (17) 
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correspondingly. 

Begin with (i) =^> (ii). Suppose that there exist an operator < A < I 
and a number p > such that the additivity conjecture is not asymptotically 
true for the sequence {&d(A,p)}d e ?q and the channel Due to (fTo^) and (fTT|) 
it means that 

max [x$®*(cr) + pTr(A ® I K ) a] > max \x*(p) + pTrAp] + C(V,&). (18) 

Let a* be a maximum point in the left side of the above inequality and 
a = Ty(A eg) I K ) cr*. By the statement (i) the additivity conjecture is true 
for the channel $ with the constraint TrAp < 1 — a and the £>-constrained 
channel So there exist such states a* and cr* G i3 that TrAa* > a and 
> • Hence 

max [x$®*(cr) + pTi(A <g> J/c) cr] = x*®*^*) + pTr(A <g> I K ) cr* 
< + + pTrAa* < max \xM + pTiAp] + B) 

in contradiction with (JTSJ). 

The proof of (ii) =>- (iii) consists of two steps. First we will prove the 
statement (iii) with the set A defined by the system of n inequalities 
TrAkp < ak, k — l,n with arbitrary arrays of operators {^4fc}fc =1 and num- 
bers {ak}k=i such that < A k < I and < a k < 1 for k — l,n. Then we 
will pass to an arbitrary set A. 

Let A be a set of the above special type. Suppose that the interior of 
the set A is nonempty. It implies that there exist such a state p that 
Tr Akp < CKfc, k = 1, n. It is sufficient to show that 

C(<S>®^;A®B) <C($;A)+C($;B). (19) 

Suppose, " > " takes place in (JT§j) . Then, there exists an ensemble {/i^cr,} 
in (5(H (g) /C) with the average state cr av , such that TrA k a® v < a k , k — l,n, 
cr* G B and 

^}) > C($; A) + ; B). (20) 

Let p av be the average of the optimal ensemble for the ^4-constrained 
channel $ so that C($;A) = X<b{Pav)- 

Note, that the state p av is the point of maximum of the concave function 
with the constraints TrA k p < a k ,k = l,n. By the Kuhn- Tucker 
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theorem J7J 1 there exists a set of nonnegative numbers {p k } k=1) sucn that p av 
is the point of the global maximum of the function x<s>(p) ~ Sfc=i PfcTr-A^p 
and the following conditions hold 



p k (TrA k p av - ot k ) = 0; k = 1, n. (21) 

It is clear that p av is also the point of the global maximum of the concave 
function + Sfc=iPfcTrA fc p, so that 

n n 

xM + J^PfcTrAfcp < x$(Pav) + ^p fc TrA fc p av , Vp G 6(H). (22) 

k=l k=l 

Let p = || X^liPiAH- The case p = means that p av is the point of 
the global maximum of x<s>{p)- So in this case the channel $ is in fact 
unconstrained. But the additivity for the unconstrained channel <3> and the 
B -constrained channel \I/ easily follows from the asymptotic additivity for 
the sequence $d(A 0) (with arbitrary operator A) and the S-constrained 
channel because each channel in this sequence is equivalent to 

In the case p > let A = V^YTi=\Vi^i- Note that < A < I. Con- 
sider the sequence $^(A,p) of Shor's extensions of the channel Assumed 
asymptotic additivity together with ()16|) and ()17j) implies 

max[x$^(a) + pTr(A <g> I K ) a] = max[x$(p) +pTrAp] +C(V;B). (23) 



p 



Due to d2TJ) and fl22J) we have 



n 

xM + T,Pk TrA k p 
k=i 



max [x$(p) + pTrAp] = max 

p p 

n _ _ n 

= X$(Pav) + E PfcTrA fc p av = C($; .A) + £ PaC 1 _ 
fe=i fe=i 



(24) 



Noting that 

Ti(A k ® fx;) o- av = TrAfefj* < a fc , = T~7i, 

we have by (|2T?|) 

max [x$®*(o") + pTr(A <g> J/c) cr] > x*®*(cr av ) + pTr(A ® J^) a av 

cr 

n _ _ _ n 

= X*®*(cr av )+ £PfcTr(A fc ®/,c)<7 av > C($; A) + C^; #) + £ p fc (l - a k ) 

k=l k=l 



1 We use the strong version of this theorem with the Slater condition, which follows 
from the assumption that the interior of the set A is nonempty. 
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The contradiction of the last inequality with (|23|) and (J24)) completes the first 
step of the proof of (ii) =>■ (iii) under the condition that the interior of the 
set A is nonempty. 

The case where the set A has no inner point may be reduced to the 
previous one with the help of the following lemma. 

Lemma 1. Let {*A n } ne N be a decreasing sequence of closed subsets of 
&(7i.) and A = OneN^n- ^ the additivity conjecture is true for the A n - 
constrained channel $ and the fi-constrained channel \l/ for all n G N then 
this conjecture is true for the .A-constrained channel $ and the £>-constrained 
channel \l/ as well. 

Proof. It is sufficient to prove that 

C($,A)= lim C(<S>,An), C(<S>®ty,A®B) = lim (7($ ® An ® B). 

n— >+oo n— >+oo 

Due to A ® B = HneN ® B the second equality is equivalent to the first. 

The nonnegative sequence A n ) is decreasing, so the first of the above 
limits exists and " < " is obvious. For each n G N let p n be the maximum 
point of the function with the constraint p G A n . By compactness 

argument we can assume that there exists lim^+oo p n = G A. The conti- 
nuity property of the function x<s>{p) gives lim^+oo x*(Pn) = which 
proves the " = " .A 

Consider the sequence {A m } of the subsets in &(H), in which subset A m 
is defined by the system of inequalities TrA k p < + l/m, k — l,n. The 
interior of each A m is nonempty. By the previous consideration the additivity 
conjecture is true for the A m -constrained channel $ and the B-constrained 
channel \1/ . The lemma 1 gives desired additivity for the ^-constrained 
channel <3> and the £>-constrained channel 

Now the proof of the additivity for arbitrary set A is very simple. Note 
that any state p in &(H) may be considered as a set A p = {p}, defined by a 
finite number of linear inequalities. The result of the first step implies 

C ($ ® *; A p ® B) = A p ) + C(V; B) = X M + £(* ; B). (25) 

Suppose that the additivity conjecture is not true for the A -constrained 
channel $ and the ^-constrained channel Then there exists such a state 
a G 6(H ® K) that cr* G A, cr* G B and 



12 



This inequality contradicts to (|25|) with p = a . The proof of (ii) =^> (hi) is 
complete. A 

The above theorem implies two sorts of results. The equivalence 
( ii ) -v^ (hi) implies the corollaries 1-3. The equivalence (i) <S=> (hi) gives 
the corollary 4. 

For the verification of the constrained additivity the following sufficient 
condition will be convenient (see the proof of the proposition 4(B)). 

Corollary 1. If the additivity conjecture holds true for the unconstrained 
channels $>d{A,p) with arbitrary pair (A,p) and the £>-constrained channel 
\1/ for all sufficiently large d, then the additivity conjecture is true for the 
A -constrained channel $ and the S-constrained channel \1/ with arbitrary 
Ac &(H). 

Corollary 2. The additivity for the Shor's channel extensions $d(Ap) 
and ty e (B,r) with arbitrary pairs {A,p) and (B,r) for all sufficiently large 
d and e implies the additivity for the ^4-constrained channel $ and the 
£>-constrained channel \1/ with arbitrary A C &(H) and B C <5(/C). 
Proof. This is obtained by double application of the corollary 1. A 
Corollary 3. If the additivity conjecture for any two unconstrained 
channels holds true then it holds for any two channels with arbitrary con- 
straints. 

Proof. This follows from corollary 2. A 

Corollary 4. If the additivity conjecture holds true for the channels $ 
and \1/ with the single linear constraints TrpA < a and TrgB < (3 corre- 
spondingly then the additivity conjecture holds true for the ^4-constrained 
channel $ and the £>-constrained channel ^ with arbitrary A C &(H) and 
B C &{K). 

Proof. Double application of the equivalence (i) <^> (hi) in the above 
theorem. A 
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4 Subadditivity property of the ^-function 



The additivity of the Holevo capacity for the channels $ and ^ with 
arbitrary constraints implies the following subadditivity property of the 
X-function: 

X*M°)<X*(<r*) + Xv(<r*), V<re6(W®/C). (26) 
To see this it is sufficient to take A = {cr*}, B = {cr*} and note that 

C($®*;{a*}®{(T 9 })>x*v*(<T), 
(7($; {a*}) = X9 (a% {a*}) = 

The subadditivity of the function x$,g)*(cr) implies its additivity: 

*W* ® a*) = + x*(<t*), Va* G 6(W), Va* G 6(/C), 

which is not obvious as well. 

Subadditivity of the function x<s>®v(. (J ) obviously implies the additivity of 
the Holevo capacity for these channels. On the other hand, it is equivalent to 
the additivity conjecture for the channels $ and ^ with arbitrary constraints, 
and, hence, by corollary 4, with arbitrary single linear constraints. Corollary 
3 shows that global subadditivity of the x-function is equivalent to the global 
additivity of the Holevo capacity for unconstrained channels. However, it is 
not clear whether ()26|) is implied by the additivity of the Holevo capacity for 
the channels <3> and \&. 

There are several cases, where (|26|) can be indeed established. 

Proposition 4. Let \P be arbitrary channel. The inequality (|26|) holds 
in each of the following cases: 

(A) $ is an entanglement breaking channel; 

(B) $ is a noiseless channel; 

(C) $ is a direct sum mixture of a noiseless channel Id and a channel $o 
such that the function x$ <x>* i s subadditive. 

Proof. (A) Shor in (TB] proved the additivity conjecture for two uncon- 
strained channel if one of them is an entanglement breaking. But in the 
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proof of this theorem the subadditivity property of the x -function was in 
fact established. It is interesting that in this case we can directly deduce the 
subadditivity of the x-function from unconstrained additivity with the help 
of corollary 2. One should only verify that entanglement breaking property 
of any channel implies similar property of Shor's extension for this channel. 

(B) The proof of this statement consists of two steps. First we will prove 
the additivity conjecture for two channels if one of them is noiseless and {p}- 
constrained while the other channel is arbitrary and unconstrained. Then we 
will apply the Shor's channel extension to pass to the {^}-constrained second 
channel. 

The proof of the first step is the modification of the proof in j3] of the 
"unconstrained" additivity for two channels if one of them is noiseless, which 
is based on the Groenevold-Lindblad-Ozawa inequality ^1] 

H(a) < J2Pj H (°j), (27) 

j 

where a is a state of a quantum system before measurement, Oj is the state 
of this system after measurement with yield j and pj is the probability of 
this yield. 

Let $ = Id and p be an arbitrary state in &(7i). We want to prove that 

C(ld ® {p} ® ©(£)) = C(Id, {p}) + C(V, ©(£)) = H(p) + C(V) (28) 

Let {/ij, CTj} be an ensemble of states in &(TC ® K) with ^ p^crf = p. By 
subadditivity of quantum entropy we obtain 

Xid®*({Mi, cri}) = if (Id <g> Wi)) ~ £ Mi#(Id ® *(<7i)) 

i i 

(29) 

< H(p) + ff(*(£ i^af)) - £ (MHild ® *(«)). 

i i 

Consider the measurement, defined by the observable {|ej)(ej-| ® I/c}, where 
{|ej)} is an orthonormal basis in TC. By (J?7]) we obtain 

H(Id *(^)) < ^,-ff (*(4)), Vz, 
i 

where p^- = (ej\<Ti\ej) and cr^- = p^- |ej-)(e 3 -| <8> Ijc • o% • l e i)( e jl ® ^c- Note that 
^2jPij<yfj = of. This and previous inequality show that two last terms in 
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do not exceed XvdfcPij) a fj}) an d> hence, C(^f). With this observation 
()29|) implies (|25|) and, hence, the first step of the proof is complete. 

It follows that the additivity conjecture holds for the (unconstrained) 
Shor's extension of the channel \l/ (with arbitrary parameters) and the {p}- 
constrained channel $ = Id. The corollary 1 gives the desired subadditivity 
property of the function Xw®*- 

(C) Let $ g = gld © (1 — q)<&Q. For an arbitrary channel \I> we have 
= g(Id(g>^)©(l-g)($ ®^). By lemma 1 in with the subadditivity 
of the functions xw®* and x<t>o®^ we obtain 

< qxid^i^) + (i - q)x$om(v) 
< qxidi^) + gx*(a*) + (l - q)x<3>o(°®) + (! - <?)x*(^*) 

= qH(a*) + (1 - g)x* (a*) + X *(a*) = + 

where the last equality follows from the existence of a pwe state ensemble 
on which the maximum in the definition of x$(°"*) is achieved. A 

Due to the MSW representation jH] and subadditivity of the quantum 
entropy the subadditivity of the x-function easily follows from the strong 
superadditivity of the entanglement of formation. What is interesting that 
the converse is also true. The strong superadditivity of the entanglement of 
formation follows from the subadditivity of the x -function for any two chan- 
nels. Together with the corollary 3 it provides another way (as compared to 
[T5|.[T!Z]) of proving that the global additivity conjecture for unconstrained 
channels (or additivity of the entanglement of formation) implies strong su- 
peradditivity of the entanglement of formation. 

The above statement is a consequence of the following theorem. 

Theorem 2. Let $ and \l/ be fixed channels. For given arbitrary 
p G &(H) and g G 6(/C) the additivity conjecture 

C ($ © {p} © M) = {p}) + C(V; {g}) 

holds if and only if 

min V/i fc #($<g)#(a fc )) = min V mH ($(a>) + min S^w j H{^{g j )), 
~f Y,' n 'iPi=p^—*' J2™jej=Q^~^ 

where the first minimum is over all ensembles a^} of states in &(7i®K) 
such that fJ'k&t = P an d ^2 Pk&t = Q- 

k k 
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Proof. The sufficiency of the above condition for the additivity of the 
Holevo capacity for the {p}-constrained channel $ and the {^-constrained 
channel $ obviously follows from the subadditivity of quantum entropy. 

Let us prove the necessity of this condition. The additivity conjecture 
for the {p}-constrained channel $ and {^-constrained channel $ implies 
the existence of nonentangled ensemble with average state p <S> g, which is 
optimal for the {p} ® {^-constrained channel $ ® ^f. By proposition 2 we 
have 

X<s>®*(p®q) = X*(p) + X*(q) > Xt>®*(v) + ^($® V(a)\m P ) ®V(g)) (30) 
for any state a in &(H.) <S> &(JC) such that a* = p and cr* = g. Note that 
S($ <g> tf(<r)||$(p) <8> = #($(p)) + H{V{q)) - H($ <g> *(<t)). (31) 

The inequality (j5D)) with (JUT]) and the definition of the x-function provides 
" > " in the condition of the theorem. Since " < " in this condition is obvious, 
the proof is complete. A 

Corollary 5. The subadditivity of the function x$®* is equivalent 
to the following property: 

X*(^) + XA^) ~ X*8*(*) > H($(<t*)) + H(*(<r*)) - H{$ ® ¥(<t)). 

for all (x G &{H®K). 

This means that the gap between x*( cr *) + X*( cr *) an d X*®*(°") is no 
less than the gap between #($(cr )) + i7(tf(cr*)) and H($ ® *(<r)). 

Proof. The sufficiency of the above property for inequality ()26|) is clear. 

To prove its necessity note that subadditivity of the function x*®* implies 
additivity for the {p}-constrained channel $ and {^-constrained channel \I> 
with any p G S(7Y) and £ G (5(/C). By the definition of the x-function the 
above property can be rewritten as 

min X> fe #($®*(ox)) > min £> i #($(p i ))+ min E ro i^(^(^))- 

k i j 

with p = er* and £ = cr*. But this inequality follows from the theorem 2. A 
Corollary 6. The global subadditivity of the x-function implies strong 

superadditivity of the entanglement of formation. 

Proof. Let Tii, Ti.2, ICi, £2 be finite dimensional Hilbert spaces 

and H = Hi <S> 7^2, K, — K.\® /C 2 . Consider the channels $«(■) = Tr^. (■) 

from ® £<) into ©(%), i = 1, 2. It is clear that $1 g> $ 2 (-) = Ttjc(-)- 
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Let a be an arbitrary state in &(Ti<S>K,), let o"i and a 2 be the partial states 
of a, corresponding to the decomposition H <g> /C = (Hi <£> /Ci) ® (TC 2 <8> /C 2 ). 
By definition we have 

XtM = - £ F (<n), x„> 2 ) = H{Ti K2 {a 2 )) - E F (a 2 ), 

X<s>i®$ 2 ( a ) = H(Tr,c(cr)) - E F (a). 

By the corollary 5 with $ = $ x and $ = $ 2 we obtain the desired strong 
superadditivity of the entanglement of formation: 

E F (a) > E F (a 1 )+ E F (a 2 ). A 

Note also the following lower bound for the Holevo capacity of the 
{p} ® {f?}-constrained channel $ <g) ^f. 

Corollary 7. Let $ and \I/ be an arbitrary channel. For given arbitrary 
states p e and £ G 6(/C) 

(7($ ® {p} ® {<?}) > C(<&; {p}) + (7(tt; {f?}) + S($(p) ® ttfc) ||$ ® *(<r av )), 

where <r av is the average state of any optimal ensemble for the {p} ® {£>}- 
constrained channel $ ® 

Proof. Direct application of the proposition 2. A 
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5 Application of the convex duality 



Corollary 5 of the theorem 2 and the arguments from the convex analysis 
raised in j2] provide the characterization of channels for which subadditivity 
of the x-function holds. For a channel $ and an operator A e Q3 + (7i) we 
introduce the following modified output purity of the channel (cf. pQ) 

u H ($, A) = min [H($(p)) + TrAp] . (32) 

We will show later that the additivity property for this value is equivalent to 
the subadditivity of the x-function. 

Let $ : &(TC) !— > &(T~C') be an arbitrary channel. For a given operator E 
in *B(TC), < E < I n , and numbers q e [0; 1], d e N = {1, 2, . . . } consider 
the channel $(E,q,d) from 23(H) into 58(W) © C d+1 , related to the Shor's 
channel extension <&(E, q, d) by the following equality: 

$(E, q, d){p) = $(E, q, d){p ® r), Vp G 6(H), 

where r is the chaotic classical state rf -1 ] in C d . We have 

$(E,q,d)(p) = (l-q)$(p)®qE(p), 

where E(p) = [Tr/xE, d _1 TYEp, <i _1 Tr.Ep] is a channel from *B(7i) into 
C d+1 . The channel $(E,q,d) was originally introduced by Shor in ^S] with 
the aim to prove that additivity of the minimum output entropy for any pair 
of channels implies additivity of the entanglement of formation. We will show 
that subadditivity of the function x$®* is closely connected to additivity of 
the minimum output entropy for the channels <3> and 

Definition 2. We say that additivity of the minimum output entropy 
holds asymptotically for the sequences of channels {$„ : &(H) i— > &(H' n )} 
and : 6(/C) i-> &{K' m )} if 

lim min H($ n <g> ^ m (a)) 

n,m^+oo cre&(H<SlK) 

= lim min H($> n (p)) + lim min H(^ m (g)), 
assuming that the limits exist and are finite. 
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Note that if additivity of the minimum output entropy holds for the 
channels $ n and \l/ m for all sufficiently large n and m it obviously holds 
asymptotically for the sequences {$ n } and {\l/ m }. 

As in the case of Shor's channel extension it is convenient to denote by 
$d{A,p) the channel <t(A,p/ log d,d). 

Theorem 3. Let $ : &{H) i-> &{W) and : 6(/C) h+ ©(£') be 
arbitrary fixed channels. The following statements are equivalent: 

(i) The function x$<g>* is subadditivite; 

(ii) For all A e <B+(H) and B G <B+(/C) 

v a ($ ® ^ <g> / + / (8) B) = v H ($, A) + i/ H B) ; 

(iii) The additivity of the minimum output entropy holds asymptotically 
for the sequences of channels {$d(A,p)} deN and {^/ e (B, r)} egN with 
arbitrary pairs (Ap) and (B,r). 

Proof, (i) (ii) Note that the function 

H*(p) = min J2mH($(pi)) 

i I 

can be considered as the convex closure of the function H$(p) = H(<&(p)), 
defined on the set of pure states 0, |Zj- The conjugate function is defined 
on the set tBhiTi.) of all hermitian operators by 

H%{p){X) = max [TrXp - H*(p)] . 
p 

The inequality in the corollary 5 can be regarded as the strong superadditivity 
of the function H$(p). By lemma 1 in = 2J this superadditivity is equivalent to 
the subadditivity of the conjugate function if| with respect to the Kronecker 
sum: 

H&A ®I K + I H ®B)< H%{A) + HUB), VA e <B h (H), V5 e 
By the definition of the last inequality is equivalent to 
max \TiA(x* + TiBa* -H($®V(a))] 

< max [TzAp-H{®{p))] + max [FtBq - #(*(<?))] 
pee(H) qe6(K) 
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for all A G <& h {H) and B G <B h (!C). 

Noting that " > " in the previous inequality is obvious and using the 
invariance of it after changing A and B on A + and B + ||.B||i/c 

correspondingly we obtain that (i) (ii). 

(ii) (iii) It is necessary to obtain the expressions for H($ d (A,p)(p)) , 
H(^ e (B,r)(g)) and H($ d (A,p) ® V e (B,r)(a)) for arbitrary p G 6(H), 
g G 6(/C) and a G &(H® K). For this aim we will use the following simple 
lemma. 

Lemma 2. Let {$j}™ =1 be a collection of channels from &(Ti) into 
&(Tlj), {gj}™ =1 be a probability distribution. Then for the channel $ = 
©J =1 fc$j from 6(H) into 6(©" =1 W$) one has 

n 

ff(*M) = + G 6(H). 

Applying this lemma to the channels <3>d(A,p) = (1 — g')$ + q'A and 
\l/ e (_B, r) = (1 — q")ty + g"B (where </ = pj logrf and g" = r / loge) we obtain 

H($ d (A,p)(p)) = h 2 (q') + (l-q')H(tZ>(p))+pTrAp + q'h 2 (TrAp), 
ff(* e (5,r)(e)) = h 2 (q / ') + (l-q")H(^(g))+rTrBg + q / 'h 2 (TrBg). 

The above expressions imply existence of the limits: 

lim min H($ d (A,p)(p)) = min \H($(p)) + pTrAp] , (33) 
d^+oo P e&{H) peeOH) 

lim rrdr i H($ e (B,r)(g)) = min [H{9(q)) + rTrBg] . (34) 

e^+oo e e6(/C) 

Let us prove that 

„ lim . ™ n H@d(A,p) <g) * e (S,r)(<r)) 

= min [#($®#(a)) + TrA7* + Tr J B ( j ,3/ l . 

Due to the representation 

®d(A,p) <g> * c (S,r) = (1 - - <f ) $ ® * 
+q'(l - q")A <g> * + (1 - </)<?"$ <g> B + gV'A <g> B 
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(35) 



(36) 



the lemma 2 reduces the calculation of the value H($ d (A,p) (g) ty e (B,r)(a)) 
to the calculation of four entropies, the first of which is <g> \&(cr)), while 
the fourth does not exceed log(d + 1) + log(e + 1). To calculate the others 
note that 

A ® *(rr) = [^(a), tT 1 ^^), cT 1 * a(<t)], 

where ^xC) = Tr^ (X <g> 1/e • (Id <g> ^ r )(-)) is a completely positive map from 
<S{H®K) into »+(£') (X = A,i4 and Id is an identity map on 6(H)). 
Therefore, 

if (A <g> #(<r)) = #(*a(^)) + #(*a(<t)) + logrfTr^(a) 

(37) 

= F(<t\A,V) + logdTrAr , 

where F(cr|A, \&) does not depend on d and does not exceed 2 log dim K' . 
Similarly, 

#($ <g> B(<r)) = G(tr|B, $) + logeTrSfx' 1 ', (38) 

where G(cx|-B, $) does not depend on e and does not exceed 2 log dim H'. 
The lemma 2 and the representation (|36|) imply 

iJ(i,(Ap)®^e(5,r)(a))=i7({gV^gXl-/)^ ,, (l-9'),(l-9')(l-'? /, )}) 

+ (1 - - </')#($ ® + (1 - <f )pTrAa* + (1 - q^mS** 

+ (1 - g ")g'F(<r|A, *) + (1 - if)q"G(<T\B, $) + q'q"H(A ® B(tr)). 
The estimates for i 7 ", G and 

(A ® B(<r)) < Mlog^ + l^+Me + l))^ 

log a log e 

imply that the right side of the above expression tends to 

H($ ® *(<t)) + TrAx* + TrSo-* 

uniformly in a when d and e tend to infinity. This proves (|35p. 

The expressions (}33j) . (}3lj) and (}35|) show that the statement (iii) is equiv- 
alent to the equality 

min \H($ ® *(<r)) + »TrAa* + rTrEa*] 

o-ee(W®/c) J 

= min L£f($(p)) +pTrAp] + min \HC&(q)) + rTiBg] 

p£6(H) QE6QC) 
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for arbitrary nonnegative numbers p, r and operators A, B such that 
< A < I-H, < B < lie- But this is exactly the statement (ii). A 

The concavity argument shows that the minimum in (}3*2*j) is achieved on 
a pure state. So, the statement (ii) of the above theorem characterizes the 
subadditivity of the x -function for the particular channels in terms of action 
of these channels and its tensor product on pure states. 

Note also that (ii) implies additivity of the minimum output entropy (the 
case A = B = 0). 

Corollary 8. Additivity of the minimum output entropy for the channels 
$d(A,p) and ^ e (B, r) with arbitrary pairs (A,p) and (B, r) for all sufficiently 
large d and e implies subadditivity of the function 

Corollary 9. Global additivity of the minimum output entropy is equiv- 
alent to the global subadditivity of the \ -function. 

Proof. By the corollary in previous section, subadditivity of the function 
X$®-z> implies additivity of the minimum output entropy for the channels $ 
and 

If additivity of the minimum output entropy holds for any pair of channels 
then, by corollary 8, the x-function is globally subadditive. A 

Corollary 10. The asymptotic additivity of the minimum output en- 
tropy for the sequences of channels {$d(A,p)}deN and {^f e (B, r)} e£ N with 
arbitrary pairs (A,p) and {B,r) is equivalent to the asymptotic additiv- 
ity of the Holevo capacity for the sequences of channels {$d(.A,p)} deN and 
{ty e (B, r)} egN with arbitrary pairs (A,p) and (B,r). 

Proof. By theorems 1 and 3 both properties are equivalent to subaddi- 
tivity of the function x*®*- 

A 

Remark. The proof of theorem 3 can be modified to obtain a direct proof 
of the fact that additivity of the minimum output entropy for any pair of 
channels implies the strong subadditivity of the entanglement of formation 
(instead of simple additivity as in J3|). To see this consider channels $ 
and \I/ of the form of the partial traces, as in the proof of the corollary 6. 
The arguments in the proof of theorem 3 shows that assumed additivity of 
the minimum output entropy for the channels <&d{A,p) and ty e (B,r) implies 
superadditivity of the function i7$ ^(cr), which in this case coincides with 
the entanglement of formation Ep(o~). 
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6 Some relations for the relative entropy 



The proofs of the theorem 2 and corollary 7 were based on proposition 2. 
In this section we provide direct proofs of them considering some interesting 
relations for the relative entropy. 

Let a and q be arbitrary states such that supper C supp^. The functions 
f(x) = S(xa + (1 — and g(x) = S(<;\\xa + (1 — x)q) are obviously 

continuous and convex on [0; l} 2 . 

Proposition 5. These functions are related by the following transfor- 
mations: 

X 

g(x) = xf'(x) - f(x), f(x) = x J ^-dt, Vx G [0; 1} 

o 

with the "initial conditions" /(0) = g(0) = and 

f (0) = ^S(xa + (1 - x)^)\ x=0 = 0. (39) 

The proof of this proposition and some features of the above transforma- 
tions can be found in the Appendix. 

Note that in the case supper = supp^ proposition 5 implies 

f (1) = ^S{xa + (1 - x)4^\ x=1 = S(a\\q) + S{<;\\a). (40) 

We will use proposition 5 with $ (g> ^(cr) and $(er*) (g> ^(cr*) in the role 
of er and <j correspondingly. Taking into account the definition of the partial 
trace one can verify that supp $ <8> ^(c) Q supp $(er*) <g> ^(cr*) (Vcr). In this 
case 

f(x) = #($(<7*)) + H(V(a*)) - H(x§ ® #(<r) + (1 - x)$(er*) ® (<r*)). 

The equality may be used for proving theorem 2 while (j4T7|) provides the 
statement of the corollary 7. 

Let {7Tj, pj} and {/ctTj, Qj} be the optimal ensembles for the {p}-constrained 
channel $ and the {^}-constrained channel \I> (with the average states p and 
g) correspondingly. Consider the mixture of an arbitrary ensemble {/x^, er^} 

2 The symbol } means ) if supper C supp<; and ] if supp^ = supper 
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(with the average state cr av ) and the ensemble {iTiZJj, Pi®Qj} with the weights 
x and 1 — x correspondingly. This new ensemble has the average state 
£Oav + (1 — x)p® g E {p} ® {g} and the Holevo quantity 

X%®y = H(<$> ® ^(xa av + (1 - x)p (8) - xJ2 VkH(<& <g> #(<7fc)) 

-(i-x) (E*i#(*te)) + E^#(*G?;) 

= X*(P) + X*(f?) + H{$ <8> ^(xa av + (1 - ar)p ® <?)) - # ($(p)) - #(*(£?)) 

+x ( E vr^($( A )) + E ^M^iQi) ~ E ^k#(* ® 

Denoting the quantity in the last brackets by A we obtain 

K x ) = Xl®y - xl®y = xA - f(x), (41) 

where f(x) — H($(p)) +H(^f(g)) -#(<!><g) ^(xcr av + (1 — x)p<g> q)) is concave 
function with /'(0) = by proposition 5. If the value A is positive, then, 
due to /'(0) = 0, we will necessarily have > X*®^ f° r sufficiently small 

x . If the value A is not positive, then x%>®^ < X%®^ f° r & U £ > 0. This 
observation proves the theorem 2. 

Consider the above construction in the case where 0"^} is an optimal 
ensemble for the {p}(g> {^-constrained channel In this case proposition 

1 implies supp $ (g> ^(cr av ) = supp $(p) (g> $(f?). Hence, by proposition 5, the 
function f(x) is defined on [0, 1] and 

/'(I) = 5($ <g> *(<7 w )||$(p) (g) + S($(p) (g) <g) *(a av )). (42) 

Due to optimality of the above ensemble the concave function h(x), defined 
by ([41)1. must be nondecreasing on [0, 1] and hence h'(x) = A — f'(x) > 0. 
By concavity, this implies f'{l) < A. But in this case 

A = E mH($(j>i)) + E KjH^iQj) - E HkH($ ® (a*)) 

i j k 

= X*®*(a) - xM - XM + S(Q ® (cr av )||$(p) ® (g)) 
= C($ (8 VP; {p} (8 {g}) - (7(d>; {p}) - {g}) 
+5($®*( < 7 lw )||$(p)®*( e )). 
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Taking into account (|42j) we obtain that the inequality /'(l) < A is equivalent 
to the inequality in the corollary 7. 

7 Appendix 

For proving the proposition 5 it is sufficient to show that f'{0) = and 
f'(x) = (f(x) + g(x))/x for x > 0. By definition 

f(x ) = £ [Tr(xa + (1 - x)q) \og(xa + (1 - x)q)} \ X=XQ 
[Tr(xa + (1 - x)q) log?] \ X=XQ = Tr(cr - q) \og(xa + (1 - a:)?) 

(43) 

[Tr(x <r + (1 - x )«?) log(xcr + (1 - x)q)] \ x=Xo - Tr(a - q) logq 

= Tr(a - <;)(log(xcr + (1 - x)q) - log?), 

where we use 4- [Tr(xocr + (1 — xo)q) log(xcr + (1 — x)q)] \ x=Xo = due to the 
fact that Xq is the maximum point of the function 

x i— > Ty(x <t + (1 — x )q) log(xcr + (1 — x)q). 

Expression (|4^j) with xq = gives /'(0) = 0. By the definition 

f(x) + g(x) = xTr(cr - q)(\og(xa + (1 - x)q) - logq). 

Comparing this with ()43j) completes the proof. A 

The transformation f(x) \-> g(x) = xf'(x) — f(x) has simple geometric 
meaning and can be represented as g{x) = f*(f'(x)), where /* is the Legendre 
transform of / j7j . The eigenvectors of this transformation are powers x a with 
corresponding eigenvalues 1 — a. 

The inequality for relative entropy S(a'\\a") > Ma' — a"\\ 2 [TU] shows 
that ^ 

f(x) > cx 2 , g(x) > cx 2 , c = -\\a — q\\ 2 . 

It is interesting to note that the "bound" cx 2 is (essentially unique) fixed 
point of the above transformations. 

Proposition 5 implies that if supper c supp? then 

l 

g(x) — > +oo as x — > 1, but / g{x)dx < +oo. 

o 
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Note also the following relations for the derivatives 



:g(x)\x=o = (n- 1)— -f{x)\ x=0 , n e N. 



dx n dx n ' 

It implies that f(x) — g(x) = o(x 2 ) in a neighborhood of zero and its sign is 
defined by the sign of the third derivative at zero of f(x). 
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